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Absjract 

In  most  instances,  the  dynamic  response  of  monetary  and  other  policies  to  shocks  is  infrequent  and 
lumpy.  The  same  holds  for  the  microeconomic  response  of  some  of  the  most  important  economic  vari- 
ables, such  as  investment,  labor  demand,  and  prices.  We  show  that  the  standard  practice  of  estimating 
the  speed  of  adjustment  of  such  variables  with  partial-adjustment  ARM  A  procedures  substantially  over- 
estimates this  speed.  For  example,  for  the  target  federal  funds  rate,  we  find  that  the  actual  response  to 
shocks  is  less  than  half  as  fast  as  the  estimated  response.  For  investment,  labor  demand  and  prices,  the 
speed  of  adjustment  inferred  from  aggregates  of  a  small  number  of  agents  is  likely  to  be  close  to  instanta- 
neous. While  aggregating  across  microeconomic  units  reduces  the  bias  (the  limit  of  which  is  illustrated 
by  Rotemberg's  widely  used  linear  aggregate  characterization  of  Calvo's  model  of  sticky  prices),  in 
some  instances  convergence  is  extremely  slow.  For  example,  even  after  aggregating  investment  across 
all  establishments  in  U.S.  manufacturing,  the  estimate  of  its  speed  of  adjustment  to  shocks  is  biased 
upward  by  more  than  SO  percent.  While  the  bias  is  not  as  extreme  for  labor  demand  and  prices,  it  still 
remains  significant  at  high  levels  of  aggregation.  Because  the  bias  rises  with  disaggregation,  findings  of 
microeconomic  adjustment  that  is  substantially  faster  than  aggregate  adjustment  are  generally  suspect. 
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1     Introduction 

The  measurement  of  the  dynamic  response  of  economic  and  policy  variables  to  shocks  is  of  central  impor- 
tance in  macroeconomics.  Usually,  this  response  is  estimated  by  recovering  the  speed  at  which  the  variable 
of  interest  adjusts  from  a  linear  time-series  model.  In  this  paper  we  argue  that,  in  many  instances,  this 
procedure  significantly  underestimates  the  sluggishness  of  actual  adjustment. 

The  severity  of  this  bias  depends  on  how  infrequent  and  lumpy  the  adjustment  of  the  underlying  vari- 
able is.  In  the  case  of  single  policy  variables,  such  as  the  federal  funds  rate,  or  individual  microeconomic 
variables,  such  as  firm  level  investment,  this  bias  can  be  extreme.  If  no  source  of  persistence  other  than 
the  discrete  adjustment  exists,  we  show  that  regardless  of  how  sluggish  adjustment  may  be,  the  econometri- 
cian  estimating  linear  autoregressive  processes  (partial  adjustment  models)  will  erroneously  conclude  that 
adjustment  is  instantaneous. 

Aggregation  across  establishments  reduces  the  bias,  so  we  have  the  somewhat  unusual  situation  where 
estimates  of  a  microeconomic  parameter  using  aggregate  data  are  less  biased  than  those  based  upon  microe- 
conomic data.  Lumpiness  combined  with  linear  estimation  procedures  is  likely  to  give  the  false  impression 
that  microeconomic  adjustments  is  significantly  faster  than  aggregate  adjustment. 

We  also  show  that,  when  aggregating  across  units,  convergence  to  the  correct  estimate  of  the  speed  of  ad- 
justment is  extremely  slow.  For  example,  for  U.S.  manufacturing  investment,  even  after  aggregating  across 
all  the  continuous  establishments  in  the  LRD  (approximately  10,000  establishments),  estimates  of  speed  of 
adjustment  are  80  percent  higher  than  actual  speed  —  at  the  two-digit  level  the  bias  easily  can  exceed  400 
percent.  Similarly,  estimating  a  Calvo  (1983)  model  using  standard  partial-adjustment  techniques  is  likely 
to  underestimate  the  sluggishness  of  price  adjustments  severely.  This  is  consistent  with  the  recent  findings 
of  Bils  and  Klenow  (2002),  who  report  much  slower  speeds  of  adjustment  when  looking  at  individual  price 
adjustment  frequencies  than  when  estimating  the  speed  of  adjustment  with  linear  time-series  models.  We 
also  show  that  estimates  of  employment  adjustment  speed  experience  similar  biases. 

The  basic  intuition  underlying  our  main  results  is  the  following:  In  linear  models,  the  estimated  speed  of 
adjustment  is  inversely  related  to  the  degree  of  persistence  in  the  data.  That  is.  a  larger  first  order  correlation 
is  associated  with  lower  adjustment  speed.  Yet  this  correlation  is  always  zero  for  an  individual  series  that  is 
adjusted  discretely  (and  has  i.i.d.  shocks),  so  that  the  researcher  will  conclude,  incorrectly,  that  adjustment 
is  infinitely  fast.  To  see  that  this  crucial  correlation  is  zero,  first  note  that  the  product  of  current  and  lagged 
changes  in  the  variable  of  concern  is  zero  when  there  is  no  adjustment  in  either  the  current  or  the  preceding 
period.  This  means  that  any  non-zero  serial  correlation  must  come  from  realizations  in  which  the  unit 
adjusts  in  two  consecutive  periods.  But  when  the  unit  adjusts  in  two  consecutive  periods,  and  whenever  it 
acts  it  catches  up  with  all  accumulated  shocks  since  it  last  adjusted,  it  must  be  that  the  later  adjustment  only 
involves  the  latest  shock,  which  is  independent  from  the  shocks  included  in  the  previous  adjustment. 

The  bias  falls  as  aggregation  rises  because  the  correlations  at  leads  and  lags  of  the  adjustments  across 
individual  units  are  non-zero.  That  is,  the  common  components  in  the  adjustments  of  different  agents  at 
different  points  in  time  provides  the  correlation  that  allows  us  to  recover  the  microeconomic  speed  of  ad- 

1 


justment.  The  larger  this  common  component  is  — as  measured,  for  example,  by  the  variance  of  aggregate 
shocks  relative  to  the  variance  of  idiosyncratic  shocks —  the  faster  the  estimate  converges  to  its  true  value  as 
the  number  of  agents  grows.  In  practice,  the  variance  of  aggregate  shocks  is  significantly  smaller  than  that 
of  idiosyncratic  shocks,  and  convergence  takes  place  at  a  very  slow  pace. 

In  Section  2  we  study  the  bias  for  microeconomic  and  single-policy  variables,  and  illustrate  its  im- 
portance when  estimating  the  speed  of  adjustment  of  monetary  policy.  Section  3  presents  our  aggregation 
results  and  highlights  slow  convergence.  We  show  the  relevance  of  this  phenomenon  for  parameters  consis- 
tent with  those  of  investment,  labor,  and  price  adjustments  in  the  U.S.  Section  4  discusses  partial  solutions 
and  extensions.  It  first  extends  our  results  to  dynamic  equations  with  contemporaneous  regressors,  such  as 
those  used  in  price-wage  equations,  or  output-gap  inflation  models.  It  then  illustrates  an  ARMA  method  to 
reduce  the  extent  of  the  bias.  Section  5  concludes  and  is  followed  by  an  appendix  with  technical  details. 

2     Microeconomic  and  Single-Policy  Series 

When  the  task  of  a  researcher  is  to  estimate  the  speed  of  adjustment  of  a  state  variable  — or  the  implicit 
adjustment  costs  in  a  quadratic  adjustment  cost  model  (see  e.g.,  Sargent  1978.  Rotemberg  1987) —  the 
standard  procedure  reduces  to  estimating  variations  of  the  celebrated  partial  adjustment  model  (PAM): 

Ay,  =  X(y? ->,_,),  (1) 

where  y  and  y*  represent  the  actual  and  optimal  levels  of  the  variable  under  consideration  (e.g.,  prices, 
employment,  or  capital),  and  X  is  a  parameter  that  captures  the  extent  to  which  imbalances  are  remedied  in 
each  period.  Taking  first  differences  and  rearranging  terms  leads  to  the  best  known  form  of  PAM: 

Ay,  =  (l-X)Ay,_!+v,,  (2) 

with  v,  =A.Ay*. 

In  this  model,  X  is  thought  of  as  the  speed  of  adjustment,  while  the  expected  time  until  adjustment 
(defined  formally  in  Section  2)  is  (1  —  A.) /A.  Thus,  as  A.  converges  to  one,  adjustment  occurs  instantaneously, 
while  as  A  decreases,  adjustment  slows  down. 

Most  people  understand  that  this  model  is  only  meant  to  capture  the  first-order  dynamics  of  more  real- 
istic but  complicated  adjustment  models.  Perhaps  most  prominent  among  the  latter,  many  microeconomic 
variables  exhibit  only  infrequent  adjustment  to  their  optimal  level  (possibly  due  to  the  presence  of  fixed 
costs  of  adjustments).  And  the  same  is  true  of  policy  variables,  such  as  the  federal  funds  rate  set  by  the 
monetary  authority  in  response  to  changes  in  aggregate  conditions.  In  what  follows,  we  inquire  how  good 
the  estimates  of  the  speed  of  adjustment  from  the  standard  partial  adjustment  approximation  (2)  are,  when 
actual  adjustment  is  discrete. 


2.1  A  Simple  Lumpy  Adjustment  Model 

Lety,  denote  the  variable  of  concern  at  time  t  — e.g.,  the  federal  funds  rate,  a  price,  employment,  or  capital — 
and  y*  be  its  optimal  counterpart.  We  can  characterize  the  behavior  of  an  individual  agent  in  terms  of  the 
equation: 

Av,  =S,(tf-y,-i),  (3) 

where  £,  satisfies: 

Pr{L=l)     =     X. 

(4) 
Pr{^,=0}     =     1-X. 

From  a  modelling  perspective,  discrete  adjustment  entails  two  basic  features:  (i)  periods  of  inaction  fol- 
lowed by  abrupt  adjustments  to  accumulated  imbalances,  and  (ii)  increased  likelihood  of  an  adjustment  with 
the  size  of  the  imbalance  (state  dependence).  While  the  second  feature  is  central  for  the  macroeconomic  im- 
plications of  state-dependent  models,  it  is  not  needed  for  the  point  we  wish  to  raise  in  this  paper.  Therefore, 
we  suppress  it.1 

It  follows  from  (4)  that  the  expected  value  of  £,,  is  X.  When  h,t  is  zero,  the  agent  experiences  inaction; 
when  its  value  is  one,  the  unit  adjusts  so  as  to  eliminate  the  accumulated  imbalance.  We  assume  that  h,,  is 
independent  of  (y*  -y,-i)  (this  is  the  simplification  that  Calvo  (1983)  makes  vis-a-vis  more  realistic  state 
dependent  models)  and  therefore  have: 

E[Ay,  |>>;.y,-i]  =  My,*-*-,),  (5) 

which  is  the  analog  of  (1).  Hence  X  represents  the  adjustment  speed  parameter  to  be  recovered. 

2.2  The  Main  Result:  (Biased)  Instantaneous  Adjustment 

The  question  now  arises  as  to  whether,  by  analogy  to  the  derivation  from  (1)  to  (2),  the  standard  procedure 
of  estimating 

Ay,  =  (l-X)Ay,_,+£„  (6) 

recovers  the  average  adjustment  speed,  X,  when  adjustment  is  lumpy.  The  next  proposition  states  that  the 
answer  to  this  question  is  clearly  no. 

Proposition  1   (Instantaneous  Estimate) 

Let  X  denote  the  OLS  estimator  ofX  in  equation  (6).  Let  the  Ay*  's  be  i.i.d.  with  mean  0  and  variance  G  , 


'The  special  model  we  consider  — i.e.,  without  feature  (ii) —  is  due  to  Calvo  (1983)  and  was  extended  by  Rotemberg  (1987) 
to  show  that,  with  a  continuum  of  agents,  aggregate  dynamics  are  indistinguishable  from  those  of  a  representative  agent  facing 
quadratic  adjustment  costs.  One  of  our  contributions  is  to  go  over  the  aggregation  steps  in  more  detail,  and  show  the  problems  that 
arise  before  convergence  is  achieved. 


and  let  T  denote  the  time  series  length.  Then,  regardless  of  the  value  ofX: 

plimy-^A.  =  1 .  (7) 

Proof  See  Appendix  B.l.     | 

While  the  formal  proof  can  be  found  in  the  appendix,  it  is  instructive  to  develop  its  intuition  in  the  main 
text  If  adjustment  were  smooth  instead  of  lumpy,  we  would  have  the  classical  partial  adjustment  model,  so 
that  the  first  order  autocorrelation  of  observed  adjustments  is  1  —  A,  thereby  revealing  the  speed  with  which 
units  adjust.  But  when  adjustment  is  lumpy,  the  correlation  between  this  period's  and  the  previous  period's 
adjustment  necessarily  is  zero,  so  that  the  implied  speed  of  adjustment  is  one,  independent  of  the  true  value 
of  A.  To  see  why  this  is  so,  consider  the  covariance  of  Ay,  and  Ay,_j,  noting  that,  because  adjustment  is 
complete  whenever  it  occurs,  we  may  re-write  (3)  as: 


Ay,  =  &  £  Ay,*_,  =    {  (8) 


Ay?_*  =  \ 

0  otherwise 

where  /,  denotes  the  number  of  periods  since  the  last  adjustment  took  place,  (as  of  period  /) 

Table  1:  CONSTRUCTING  THE  MAIN  COVARIANCE 


Adjust  in  t  —  ) 

t      Adjust  in  t 

Ay,_i 

Ay, 

Contribution  to  Cov(Ay, ,  Ay,- 1 ) 

No 

No 

0 

0 

Ay,Ay,_,  =0 

No 

Yes 

0 

Ay,* 

Ay,Ay,_,  =0 

Yes 

No 

5t'oAy;_, 

I-* 

0 

Ay,Ay,_i  =0 

Yes 

Yes 

sjbiAtfi, 

I-* 

Ay; 

Cov(Ay,_,,Ay,)=0 

There  are  four  scenarios  to  consider  when  constructing  the  key  covariance  (see  Table  1):  If  there  was 
no  adjustment  in  this  and/or  the  last  period  (three  scenarios),  then  the  product  of  this  and  last  period's 
adjustment  is  zero,  since  at  least  one  of  the  adjustments  is  zero.  This  leaves  the  case  of  adjustments  in  both 
periods  as  the  only  possible  source  of  non-zero  correlation  between  consecutive  adjustments.  Conditional 
on  having  adjusted  both  in  /  and  /  —  1 ,  we  have 

Cov(Ay, ,  Ay,_,  |  £,=£,_,  =  1)  =  Cov(Ay,*  ,  Ay,*_,  +  Ay,*_2  +  •  •  ■  +  Ay?.^ )  =  0, 

since  adjustments  in  this  and  the  previous  period  involve  shocks  occurring  during  disjoint  time  intervals.  Ev- 
ery time  the  unit  adjusts,  it  catches  up  with  all  previous  shocks  it  had  not  adjusted  to  and  starts  accumulating 
shocks  anew.  Thus,  adjustments  at  different  moments  in  time  are  uncorrelated. 


-So  that  /,  =  1  if  the  unit  adjusted  in  period  /  -  1,  2  if  it  did  not  adjust  in  r  -  1  and  adjusted  in  t  -  2,  and  so  on. 


2.3     Robust  Bias:  Infrequent  and  Gradual  Adjustment 

Suppose  now  that  in  addition  to  the  infrequent  adjustment  pattern  described  above,  once  adjustment  takes 
place,  it  is  only  gradual.  Such  behavior  is  observed,  for  example,  when  there  is  a  time-to-build  feature  in 
investment  (e.g.,  Majd  and  Pindyck  (1987))  or  when  policy  is  designed  to  exhibit  inertia  (e.g..  Goodfriend 
(1987),  Sack  (1998),  or  Woodford  (1999)).  Our  main  result  here  is  that  the  econometrician  estimating  a 
linear  ARMA  process  — a  Calvo  model  with  additional  serial  correlation —  will  only  be  able  to  extract  the 
gradual  adjustment  component  but  not  the  source  of  sluggishness  from  the  infrequent  adjustment  compo- 
nent. That  is,  again,  the  estimated  speed  of  adjustment  will  be  too  fast,  for  exactly  the  same  reason  as  in  the 
simpler  model. 

Let  us  modify  our  basic  model  so  that  equation  (3)  now  applies  for  a  new  variable  y,  in  place  of  y,, 
with  Ay,  representing  the  desired  adjustment  of  the  variable  that  concerns  us,  Ay,.  This  adjustment  takes 
place  only  gradually,  for  example,  because  of  a  time-to-build  component.  We  capture  this  pattern  with  the 
process: 

K  K 

Ay,  =  X  <bkAy,-k  +  (!-]£  <h)Ay, .  (9) 

k=]  k=\ 

Now  there  are  two  sources  of  sluggishness  in  the  transmission  of  shocks.  Ay*,  to  the  observed  variable.  Ay,. 
First,  the  agent  only  acts  intermittently,  accumulating  shocks  in  periods  with  no  adjustment.  Second,  when 
he  adjusts,  he  does  so  only  gradually. 

By  analogy  with  the  simpler  model,  suppose  the  econometrician  approximates  the  lumpy  component  of 
the  more  general  model  by: 

Ay,  =  (l-k)Ay,-i+v,.  (10) 

Replacing  (10)  into  (9),  yields  the  following  linear  equation  in  terms  of  the  observable.  Ay,: 

K+\ 

Ay,  =  £a*Ay,_A. +  E,,  (11) 

*=i 

with 

a\         =    <j)i  +  l-A., 

ak         =    to-O-XJfc-i,         k  =  2,...,K,  (12) 

«A'+1      =      -(l-A.)<|>Ki 

ande,  =  Ml-Z*Lj<MArf. 

By  analogy  to  the  simpler  model,  we  now  show  that  the  econometrician  will  miss  the  source  of  persis- 
tence stemming  from  X. 

Proposition  2  (Omitted  Source  of  Sluggishness) 

Let  all  the  assumptions  in  Proposition  1  hold,  with  y  in  the  role  of  y.   Also  assume  that  (9)  applies. 

with  all  roots  of  the  polynomial  1  -  Xf=  i  <|)it^*  outside  the  unit  disk.  Let  a^.k  =  1 K+\  denote  the  OLS 

estimates  of  equation  (77). 


Then: 

plim-r^Ok  fa,         k  =  l,...,K 


(13) 
phmr^<x>dK+i     =    0. 


Proof  See  Appendix  B.l. 


Comparing  (12)  and  (13)  we  see  that  the  proposition  simply  reflects  the  fact  that  the  (implicit)  estimate 
of  X  is  one. 

The  mapping  from  the  biased  estimates  of  the  a^s  to  the  speed  of  adjustment  is  slightly  more  cumber- 
some, but  the  conclusion  is  similar.  To  see  this,  let  us  define  the  following  index  of  expected  response  time 
to  capture  the  overall  sluggishness  in  the  response  of  Ay  to  Ay*: 


*>0 


dAy, 


+k 


dAy* 


(14) 


where  E,[J  denotes  expectations  conditional  on  information  (that  is,  values  of  Ay  and  Ay*)  known  at  time 
/.  This  index  is  a  weighted  sum  of  the  components  of  the  impulse  response  function,  with  weights  equal  to 
the  number  of  periods  that  elapse  until  the  corresponding  response  is  observed.3  For  example,  an  impulse 
response  with  the  bulk  of  its  mass  at  low  lags  has  a  small  value  of  x,  since  Ay  responds  relatively  fast  to 
shocks. 

It  is  easy  to  show  (see  Propositions  Al  and  A2  in  the  Appendix)  that  both  for  the  standard  Partial 
Adjustment  Model  (1)  and  for  the  simple  lumpy  adjustment  model  (3)  we  have 

l-X 

More  generally,  for  the  model  with  both  gradual  and  lumpy  adjustment  described  in  (9),  the  expected 
response  to  Ay  satisfies: 

T       iLiWk 

while  the  expected  response  to  shocks  Ay*  is  equal  to:4 

1-*    ,      lk=iWk 


*       i-ZLi** 


(15) 


3Note  that,  for  the  models  at  hand,  the  impulse  response  is  always  non-negative  and  adds  up  to  one.  When  the  impulse  response 
does  not  add  up  to  one,  the  definition  above  needs  to  be  modified  to: 

t_S*>o*E,[^] 
4For  the  derivation  of  both  expressions  for  t  see  Propositions  Al  and  A3  in  the  Appendix. 


Let  us  label: 

1-A 

tlum  =       y      ■ 

It  follows  that  that  the  expected  response  when  both  sources  of  sluggishness  are  present  is  the  sum  of  the 
responses  to  each  one  taken  separately. 

We  can  now  state  the  implication  of  Proposition  2  for  the  estimated  expected  time  of  adjustment,  t. 

Corollary  1  (Fast  Adjustment)  Let  the  assumptions  of  Proposition  2  hold  and  let  x  denote  the  (classical) 
method  of  moments  estimator  for  x  obtained  from  OLS  estimators  of 

Ay,  =  ^  akAy,_k  +  e,. 
k=\ 

Then: 

plimr_„x  =  Xii„  <  x  =  xi,n  +  xlum.  (16) 

with  a  strict  inequality  for  X  <  1. 

Proof  See  Appendix  B.l.     | 

To  summarize,  the  linear  approximation  for  Ay  (wrongly)  suggests  no  sluggishness  whatsoever,  so  that 
when  this  approximation  is  plugged  into  the  (correct)  linear  relation  between  Ay  and  Ay,  one  source  of 
sluggishness  is  lost.  This  leads  to  an  expected  response  time  that  completely  ignores  the  sluggishness 
caused  by  the  lumpy  component  of  adjustments. 

2.4     An  Application:  Monetary  Policy 

Figure  1  depicts  the  monthly  evolution  of  the  intended  federal  funds  rate  during  the  Greenspan  era.5  The 
infrequent  nature  of  adjustment  of  this  policy  variable  is  evident  in  the  figure.  It  is  also  well  known  that 
monetary  policy  interventions  often  come  in  gradual  steps  (see,  e.g..  Sack  (1998)  and  Woodford  (1999)). 
fitting  the  description  of  the  model  we  just  characterized. 

Our  goal  is  to  estimate  both  components  of  x,  X|in  and  Xium.  Regarding  the  former,  we  estimate  AR 
processes  for  Ay  with  an  increasing  number  of  lags,  until  finding  no  significant  improvement  in  thegoodness- 
of-fit.  This  procedure  is  warranted  since  Ay,  the  omitted  regressor,  is  orthogonal  to  the  lagged  Ay's.  We 
obtained  an  AR(3)  process,  with  T|in  estimated  as  2.35  months. 

If  the  lumpy  component  is  relevant,  the  (absolute)  magnitude  of  adjustments  of  Ay  should  increase  with 
the  number  of  periods  since  the  last  adjustment.  The  longer  the  inaction  period,  the  larger  the  number  of 
shocks  in  Ay*  to  which  y  has  not  adjusted,  and  hence  the  larger  the  variance  of  observed  adjustments. 

To  test  this  implication  of  lumpy  adjustment,  we  identified  periods  with  adjustment  in  y  as  those  where 
the  residual  from  the  linear  model  takes  (absolute)  values  above  a  certain  threshold,  M.  Next  we  partitioned 


5The  findings  reported  in  this  section  remain  valid  if  we  use  different  sample  periods. 


Figure  1: 

Monthly  Intended  (Target)  Federal  Funds  Rate-  9/1987  -  3/2002 


those  observations  where  adjustment  took  place  into  two  groups.  The  first  group  included  observations 
where  adjustment  also  took  place  in  the  preceding  period,  so  that  the  estimated  Ay  only  reflects  innovations 
of  Ay*  in  one  period.  The  second  group  considered  the  remaining  observations,  where  adjustments  took 
place  after  at  least  one  period  with  no  adjustment. 

Columns  2  and  3  in  Table  2  show  the  variances  of  adjustments  in  the  first  and  second  group  described 
above,  respectively,  for  various  values  of  M.  Interestingly,  the  variance  when  adjustments  reflects  only 
one  shock  is  significantly  smaller  than  the  variance  of  adjustments  to  more  than  one  shock  (see  column  4). 
If  there  were  no  lumpy  component  at  all  (k  =  1),  there  would  be  no  systematic  difference  between  both 
variances,  since  they  would  correspond  to  a  random  partition  of  observations  where  the  residual  is  larger 
than  M.  We  therefore  interpret  our  findings  as  evidence  in  favor  of  significant  lumpy  adjustment.6 

To  actually  estimate  the  contribution  of  the  lumpy  component  to  overall  sluggishness,  we  need  to  esti- 
mate the  fraction  of  months  where  an  adjustment  in  y  took  place.  Since  we  only  observe  y,  we  require  some 
additional  information  to  determine  this  adjustment  rate.  If  we  had  a  criterion  to  choose  the  threshold  M, 
this  could  be  readily  done.  The  fact  that  the  Fed  changes  rates  by  multiples  of  0.25  suggests  that  reasonable 
choices  for  M  are  in  the  neighborhood  of  this  value.  Columns  5  and  6  in  Table  2  report  the  values  estimated 
for  X  and  t/„m  for  different  values  of  M.  For  all  these  cases,  the  estimated  lumpiness  is  substantial. 

An  alternative  procedure  is  to  extract  lumpiness  from  the  behavior  of  y,  directly.  For  this  approach,  we 
used  four  criteria:  First,  if  y,  -£  yt-\  and  y,_]  —  y,_2  then  an  adjustment  of  y  occurred  at  /.  Similarly  if  a 
"reversal"  happened  at  t,  that  is,  if  y,  >  y,_i  and  y,_i  <  y,_2  (ory,  <  y(_i  and  y,_i  >  y,_2).  By  contrast,  if 
y,  =  >v_i,  we  assume  that  no  adjustment  took  place  in  period  /.  Finally,  if  an  "acceleration"  took  place  at 


Also  note  that  (he  first  order  autocorrelation  of  residuals  is  -0.002,  so  that  our  finding  cannot  be  attributed  to  this  factor  either. 


Table  2:  ESTIMATING  THE  LUMPY  COMPONENT 


(1)  (2)  (3)  (4)  (5)         (6) 

M        Var(Ay,|^  =  ll^_i  =  l)     Var(Ay,|^  =  1,^_,  =0)     p-value         A         xlum 


0.150 

0.089 

0.175 

0.092 

0.200 

0.117 

0.225 

0.145 

0.250 

0.150 

0.275 

0.164 

0.300 

0.207 

0.325 

0.207 

0.350 

0.224 

0.213 

0.032 

0.365 

1.74 

0.221 

0.018 

0.323 

2.10 

0.230 

0.051 

0.253 

2.95 

0.267 

0.060 

0.206 

3.85 

0.325 

0.034 

0.165 

5.06 

0.378 

0.016 

0.135 

6.41 

0.378 

0.045 

0.123 

7.13 

0.378 

0.045 

0.123 

7.13 

0.433 

0.041 

0.100 

9.00 

For  various  values  of  M  (see  Column  1),  values  reported  in  the  remaining  columns  are  as  follows.  Columns 
2  and  3:  Estimates  of  the  variance  of  Ay,  conditional  on  adjusting,  for  observations  where  adjustment  also  took 
place  the  preceding  period  (column  2)  and  with  no  adjustment  in  the  previous  period  (column  3).  Column  4: 
p-value,  obtained  via  bootstrap,  for  both  variances  being  the  same,  against  the  alternative  that  the  latter  is  larger. 
Column  5:  Estimates  of  X.  Column  6:  Estimates  of  Tjum. 


t,  so  that  yt  —  y(_i  >  y,_i  —  »_?  >  0  (ory,  —  >V-i  <  y,-\  —  yt-i  <  0),  we  assume  that  y  adjusted  at  r.  With 
these  criteria  we  can  sort  156  out  of  the  174  months  in  our  sample  into  (lumpy)  adjustment  taking  place  or 
not.  This  allows  us  to  bound  the  (estimated)  value  of  A.  between  the  estimate  we  obtain  by  assuming  that  no 
adjustment  took  place  in  the  remaining  18  periods  and  that  in  which  all  of  them  correspond  to  adjustments 
for  y. 

Table  3:  MODELS  FOR  THE  INTENDED  FEDERAL  FUNDS  RATE 
Time-to-build  component  Lumpy  component 

Tl t2 $3 T-lin Kmn ''•max  ^lum.min       ^lum.max 

0.230        0.080        0.231        2.35  0.221        0.320         2.13  3.53 

(0.074)     (0.076)     (0.074)     (0.95)  (0.032)     (0.035)      (0.34)        (0.64) 

Reported:  Estimation  of  both  components  of  the  sluggishness  index  x.  Data:  Intended  (Target) 
Federal  Funds  Rate,  monthly,  9/1987-3/2002.  Standard  deviations  in  parenthesis,  obtained  via  Delta 
method. 


Table  3  summarizes  our  estimates  of  the  expected  response  time  obtained  with  this  procedure.  A  re- 
searcher who  ignores  the  lumpy  nature  of  adjustments  only  would  consider  the  AR-component  and  would 
infer  a  value  of  x  equal  to  2.35  months.  Yet  once  we  consider  infrequent  adjustments,  the  correct  estimate  of 
x  is  somewhere  between  4.48  and  5.88  months.  That  is,  ignoring  lumpiness  (wrongly)  suggests  a  response 
to  shocks  that  is  approximately  twice  as  fast  as  the  true  response.7 

Consistent  with  our  theoretical  results,  the  bias  in  the  estimated  speed  of  adjustment  stems  from  the 


7Note  that  these  coincide  with  estimates  obtained  for  M  in  the  0.175  to  0.225  range,  see  Table  2. 


infrequent  adjustment  of  monetary  policy  to  news.  As  shown  above,  this  bias  is  important,  since  infrequent 
adjustment  accounts  for  at  least  half  of  the  sluggishness  in  modern  U.S.  monetary  policy. 

3     Slow  Aggregate  Convergence 

Could  aggregation  solve  the  problem  for  those  variables  where  lumpiness  occurs  at  the  microeconomic 
level?  In  the  limit,  yes.  Rotemberg  (1987)  showed  that  the  aggregate  equation  resulting  from  individual 
actions  driven  by  the  Calvo-model  indeed  converges  to  the  partial-adjustment  model.  That  is,  as  the  number 
of  microeconomic  units  goes  to  infinity,  estimation  of  equation  (6)  for  the  aggregate  does  yield  the  correct 
estimate  of  X,  and  therefore  x.  (Henceforth  we  return  to  the  simple  model  without  time-to-build). 

But  not  all  is  good  news.  In  this  section,  we  show  that  when  the  speed  of  adjustment  is  already  slow,  the 
bias  vanishes  very  slowly  as  the  number  of  units  in  the  aggregate  increases.  In  fact,  in  the  case  of  investment 
even  aggregating  across  all  U.S.  manufacturing  establishments  is  not  sufficient  to  eliminate  the  bias. 

3.1     The  Result 

Let  us  define  the  N-aggregate  change  at  time  t,  Ayf,  as: 


^=ii><v, 


where  Ay,,  denotes  the  change  in  the  variable  of  interest  by  unit  i  in  period  t. 
Technical  Assumptions  (Shocks) 

Let  Ay* t  =  v^  +  vf,,  where  the  absence  of  a  subindex  i  denotes  an  element  common  to  all  i  (i.e,  that 
remains  after  averaging  across  all  f's).  We  assume: 

1 .  the  v^'s  are  i.i.d.  with  mean  /^  and  variance  o2A  >  0, 

2.  the  vj,'s  are  independent  (across  units,  over  time,  and  with  respect  to  the  v^'s),  identically  distributed 
with  zero  mean  and  variance  oj  >  0,  and 

3.  the  2^/s  are  independent  (across  units,  over  time,  and  with  respect  to  the  v^'s  and  v^'s),  identically 
distributed  Bernoulli  random  variables  with  probability  of  success  X  G  (0, 1].     | 

As  in  the  single  unit  case,  we  now  ask  whether  estimating 

6tf  =  (l-X)Arf_1+el,  (17) 

yields  a  consistent  (as  T  goes  to  infinity)  estimate  of  X,  when  the  true  microeconomic  model  is  (8).  The 
following  proposition  answers  this  question  by  providing  an  explicit  expression  for  the  bias  as  a  function  of 


10 


the  parameters  characterizing  adjustment  probabilities  and  shocks  (X.  fi^,  Ga  and  G/)  and,  most  importantly, 
N. 

Proposition  3  (Aggregate  Bias) 

Let  XN  denote  the  OLS  estimator  ofX  in  equation  (17).  Let  T  denote  the  time  series  length.  Then,  under 
the  Technical  Assumptions. 

plim^J."  -  X  +  i=-,  (18) 

1  +  A 


ith 


It  follows  that: 


K  =    2^V      .    \\   ,    ■  (19) 


lim  plimr_>00Aw  =  X.  (20) 

N— •<» 


Proof  See  Theorem  Bl  in  the  Appendix.     I 

To  see  the  source  of  the  bias  and  why  aggregation  reduces  it,  we  begin  by  writing  the  first  order  auto- 
correlation, pi ,  as  an  expression  that  involves  sums  and  quotients  of  four  different  terms: 

^  Cov(A3f  ,Ay*  ,)  =  [/VCov(Ayi.,.Ayi.,-i)  +  N{N  -  l)Cov(Ayu,Ay2,,_1)]//V2 
Pl  Var(Ay?)  [A/Var(Ay„)  +  /V(N  -  l)Cov(Ay,.f,Ay2.,)]/iV2 

where  the  subindex  1  and  2  in  Ay  denote  two  different  units. 

The  numerator  of  (21)  includes  A7  (by  symmetry  identical)  first-order  autocovariance  terms,  one  for  each 
unit,  and  N(N  —  1)  (also  identical)  first-order  cross-covariance  terms,  one  for  each  pair  of  different  units. 
Likewise,  the  denominator  considers  N  identical  variance  terms  and  N (N  ~  1)  identical  contemporaneous 
cross-covariance  terms. 

From  columns  2  and  4  in  Table  4  we  observe  that  the  cross-covariance  terms  under  PAM  and  lumpy 
adjustment  are  the  same.8  Since  these  terms  will  dominate  for  sufficiently  large  N  — there  are  N(N  —  1)  of 
them,  compared  to  N  additional  terms —  it  follows  that  the  bias  vanishes  as  N  goes  to  infinity. 


8This  is  somewhat  remarkable,  since  the  underlying  processes  are  quite  different.  For  example,  consider  the  first-order  cross- 
covariance  term.  In  the  case  of  PAM,  adjustments  at  all  lags  contribute  to  the  cross-covariance  term: 

Cov(Ay,,,,Ay2,,_1)      =     Cov(X  £(1  -X)kA/U-k  .  *X0  -^'^-l-/) 

*>o  ;>o 

=       X   X2(l-X)w+,Cov(AyIi,_Jt,Aj5il_1_l) 


5>:u-*) 


2/+ I 


By  contrast,  in  the  case  of  the  lumpy  adjustment  model,  the  non-zero  terms  obtained  when  calculating  the  covanance  between  Ayj., 
and  Ayij- 1  are  due  to  aggregate  shocks  included  both  in  the  adjustment  of  unit  1  (in  ;)  and  unit  2  (in  I  —  1).  Idiosyncratic  shocks 


II 


Table  4:  Constructing  the  First  Order  Correlation 


(1) 

(2) 

(3) 

(4) 

Cov(Ayiir,Ayiif_i) 

Cov(Ayu,Ay2.t-i) 

Var(Ay,,,) 

Cov(AyM,Ay2i,) 

(1) 

PAM: 

^(l-XKoJ  +  of) 

&(i-*)<>3 

nK+°;) 

2-X°A 

(2) 

Lumpy  (fiA  =  0): 

0 

&(l-*)°5 

°5  +  G/ 

2-X°A 

(3) 

Lumpy  ((iA7^0): 

-(1-X>2 

&0-*K 

°iW  +  ^ 

tf 

2-X°A 

However,  the  underlying  bias  may  remain  significant  for  relatively  large  values  of  A7.  From  Table  4  it 
follows  that  the  bias  for  the  estimated  first-order  autocorrelation  originates  from  the  autocovariance  terms 
included  in  both  the  numerator  and  denominator.  The  first-order  autocovariance  term  in  the  numerator 
is  zero  for  the  lumpy  adjustment  model,  while  it  is  positive  under  PAM  (this  is  the  bias  we  discussed  in 
Section  2).  And  even  though  the  number  of  terms  with  this  bias  is  only  N,  compared  with  N(N  —  1)  cross- 
covanance  terms  with  no  bias,  the  missing  terms  are  proportional  to  GA  +  rj2,  while  those  that  are  included 
are  proportional  to  g2a,  which  is  considerably  smaller  in  all  applications.  This  suggests  that  the  bias  remains 
significant  for  relatively  large  values  of  N  (more  on  this  below)  and  that  this  bias  rises  with  the  relative 
importance  of  idiosyncratic  shocks. 

There  is  a  second  source  of  bias  once  N  >  1,  related  to  the  variance  term  Var(Ayi ,)  in  the  denominator 
of  the  first-order  correlation  in  (21).  While  under  PAM  this  variance  is  increasing  in  A,  varying  between  0 
(when  A  —  0)  and  GA  +  a2  (when  A  =  1),  when  adjustment  is  lumpy  this  variance  attains  the  largest  possible 
value  under  PAM,  GA  +  o~2,  independent  of  the  underlying  adjustment  speed  A.  This  suggests  that  the  bias  is 
more  important  when  adjustment  is  fairly  infrequent.9 

Substituting  the  terms  in  the  numerator  and  denominator  of  (21)  by  the  expressions  in  the  second  row  of 


are  irrelevant  as  far  as  the  covariance  is  concerned.  It  follows  that: 

Cov[Ayi,,  ,Ay2,,-i|E,i,,  =  1,  £2,1-1  =  UuA^i]  =  min(lh,  -  l,/2,,-i)a^, 
and  averaging  over  /] ,  and  /2,1-L  both  of  which  follow  (independent)  Poisson  processes,  we  obtain: 

Cov(Ayi,„Ay2,,_1)=  ^(l-X)^, 


(22) 


which  is  the  expression  obtained  under  PAM. 

9To  further  understand  why  Var(A>'i,)  can  be  so  much  larger  in  a  Calvo  model  than  with  partial  adjustment,  we  compare  the 
contribution  to  this  variance  of  shocks  that  took  place  k  periods  ago,  V^,. 
For  PAM  we  have 

Var(Ay„)  =  Var(  £  X(l  -  X)*AyJ  f_k)  =  X2  £  (1  -  X)aVar(AyJ ,_,),  (23) 

k>0  k>0 
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Table  4,  and  dividing  numerator  and  denominator  by  N(N  -  1  )X/(2  —  X)  leads  to: 

1-X 

Pl  = n-? 3fv  (24) 

This  expression  confirms  our  discussion.  It  illustrates  clearly  that  the  bias  is  increasing  in  G//ga  and  de- 
creasing in  A.  and  N. 

Finally,  we  note  that  a  value  of  /x^  7^  0  biases  the  estimates  of  the  speed  of  adjustment  even  further. 
The  reason  for  this  is  that  it  introduces  a  sort  of  "spurious"  negative  correlation  in  the  time  series  of  Ayi,. 
Whenever  the  unit  does  not  adjust,  its  change  is,  in  absolute  value,  below  the  mean  change.  When  adjustment 
finally  takes  place,  pent-up  adjustments  are  undone  and  the  absolute  change,  on  average,  exceeds  \ha\.  The 
product  of  these  two  terms  is  clearly  negative,  inducing  negative  serial  correlation.10 

Summing  up,  the  bias  obtained  when  estimating  X  with  standard  partial  adjustment  regressions  can  be 
expected  to  be  significant  when  either  ct^/g/,  N  or  X  is  small,  or  |it.i|  is  large.  Figure  2  illustrates  how  XN 
converges  to  X.  The  baseline  parameters  (solid  line)  are  11,4  =  0,  A.  =  0.20.  Ga  =  0.03  and  G/  =  0.24.  The 
solid  line  depicts  the  percentage  bias  as  N  grows.  For  N  =  1,000,  the  bias  is  above  100%;  by  the  time 
N  =  10.000,  it  is  slightly  above  20%.  The  dash-dot  line  increases  Oa  to  0.04.  speeding  up  convergence. 
By  contrast,  the  dash-dot  line  considers  jxa  =  0.10,  which  slows  down  convergence.  Finally,  the  dotted  line 
shows  the  case  where  X  doubles  to  0.40.  which  also  speeds  up  convergence. 

Corollary  2  (Slow  Convergence) 

The  bias  in  the  estimator  of  the  adjustment  speed  is  increasing  in  G/  and  I/I4 1  and  decreasing  in  Oa  •  N  and 
X.  Furthermore,  the  four  parameters  mentioned  above  determine  the  bias  of  the  estimator  via  a  decreasing 
expression  of  K. ' ' 

Proof  Trivial.     I 


so  that 

^PAM=X2(1_x)2*(02+o2) 

By  contrast,  in  the  case  of  lumpy  adjustment  we  have 

viunw     =     Fr{/i,,=Jk}VSff(Ay,il|/,/=t) 

Pr{/,.,  =A-}[AVar(Ay,,|/„  =  1,1;,.,  =  1)  +  (I  -X-)Var(Ay,,,|f,,r  =  0,&  =0)] 
=     \(\-\)k-l[lk{<?A+<$)}. 

Vkt  is  much  larger  under  the  lumpy  adjustment  model  than  under  PAM.  With  infrequent  adjustment  the  relevant  conditional  distri- 
bution is  a  mixture  of  a  mass  at  zero  (corresponding  to  no  adjustment  at  all)  and  a  normal  distribution  with  a  variance  that  grows 
linearly  with  k  (corresponding  to  adjustment  in  t ).  Under  PAM.  by  contrast,  V; ,  is  generated  from  a  a  normal  distribution  with  zero 
mean  and  variance  that  decreases  with  k. 

We  also  have  that  /i,*  ^  0  further  increases  the  bias  due  to  the  variance  term,  see  entry  (3,3)  in  Table  4. 
"The  results  for  a/  and  k  may  not  hold  if  \nA\  is  large.  For  the  results  to  hold  we  need  N  >  1  +  (2  -  X)ti2A/\a2A.  When  Ma  =  u 
this  is  equivalent  to  A'  >  1  and  therefore  is  not  binding. 
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Figure  2: 

Bias  as  a  function  of  N  for  various  parameter  configurations 
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3.2     Applications 

Figure  2  shows  that  the  bias  in  the  estimate  of  the  speed  of  adjustment  is  likely  to  remain  significant,  even 
when  estimated  with  very  aggregated  data.  In  this  section  we  provide  concrete  examples  based  on  estimates 
for  U.S.  employment,  investment,  and  price  dynamics.  These  series  are  interesting  because  there  is  extensive 
evidence  of  their  infrequent  adjustment  at  the  microeconorruc  level. 

Let  us  start  with  U.S.  manufacturing  employment.  We  use  the  parameters  estimated  by  Caballero,  Engel, 
and  Haiti  wanger(  1997)  with  quarterly  Longitudional  Research  Datafile  (LRD)  data.  Table  5  shows  that  even 
when  N  —  1,000,  which  corresponds  to  many  more  establishments  than  in  a  typical  two-digit  sector  of  the 
LRD.  the  bias  remains  above  40  percent.  That  is,  estimated  speeds  of  adjustment  at  the  sectoral  level  are 
likely  to  be  significantly  faster  than  the  true  speed  of  adjustment. 

The  good  news  in  this  case  is  that  for  N  =  10,000,  which  is  about  the  size  of  the  continuous  sample  in 
the  LRD,  the  bias  essentially  vanishes. 

The  results  for  prices,  reported  in  Table  6,  are  based  on  the  estimate  of  X,  Ha  and  04.  from  Bils  and 
Klenow  (2002),  while  a/  is  consistent  with  that  found  in  Caballero  et  al  (1997). ,2  The  table  shows  that  the 


Togo  from  the  a;  computed  for  employment  in  Caballero  et  al.  (1997)  to  that  of  prices,  we  note  that  if  the  demand  faced  by  a 
monopolistic  competitive  firm  is  isoelastic,  its  production  function  is  Cobb-Douglas,  and  its  capital  fixed  (which  is  nearly  correct 
at  high  frequency),  then  (up  to  a  constant): 

P*a  =  (wi  -flir)  +  0  -°-l)1'i 
where  p*  and  I*  denote  the  logarithms  of  frictionless  price  and  employment,  w,  and  a,,  are  the  logarithm  of  the  nominal  wage 
and  productivity,  and  a^  is  the  labor  share.  It  is  straightforward  to  see  that  as  long  as  the  main  source  of  idiosyncratic  variance  is 
demand,  which  we  assume,  a/ .  ~  (1  —  a/Jo-/,. . 
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Table  5:  Slow  Convergence:  Employment 
Average  X 

Number  of  agents  (N) 
100  1,000  10.000 


Number         35 
of 
Time  200 

Periods 
(T) 


0.901         0.631  0.523 

0.852         0.548  0.436 

0.844         0.532  0.417 


0.400 


Reported:  average  of  OLS  estimates  of  X,  obtained  via  simulations.  Number  of 
simulations  chosen  to  ensure  that  numbers  reported  have  a  standard  deviation  less 
than  0.002.  Case  T  =  °°  calculated  from  Proposition  3.  Simulation  parameters: 
X:  0.40,  jxa  =  0.005.  aA  =  0.03,  o/  =  0.25.  Quarterly  data,  from  Caballero  et  al. 
(1997). 


bias  remains  significant  even  for  N  =  10,000.  In  this  case,  the  main  reason  for  the  stubborn  bias  is  the  high 
value  of  0/ /oa ■ 

Table  6:  SLOW  CONVERGENCE:  PRICES 
Average  X 


100 


Number  of  agents  (N) 
1,000  10,000 


Number         60 

of 
Time  500 

Periods 

(7) 


0.935  (if)  14 
0.908  0.542 
0.902         0.533 


0.351 
0.279 
0.269         0.220 


Reported:  average  OLS  estimates  of  X,  obtained  via  simulations.  Number  of 
simulations  chosen  to  ensure  that  numbers  reported  have  a  standard  deviation  less 
than  0.002.  Case  T  =  «*>  calculated  from  Proposition  3.  Simulation  parameters: 
X:  0.22  (monthly  data,  Bils  and  Klenow,  2002),  nA  =  0.003,  aA  =  0.0054,  a,  = 
0.048. 

Finally,  Table  7  reports  the  estimates  for  equipment  investment,  the  most  sluggish  of  the  three  series. 
The  estimate  of  X,  fi^  ar)d  Oa<  are  from  Caballero,  Engel,  and  Haltiwanger  (1995),  and  rj/  is  consistent  with 
that  found  in  Caballero  et  al  (1997). 13  Here  the  bias  remains  very  large  and  significant  throughout.  Even 
when  N  —  10,000,  the  estimated  speed  of  adjustment  exceeds  the  actual  speed  by  more  than  80  percent.  The 


,3To  go  from  the  07  computed  for  employment  in  Caballero  et  al  (1997)  to  that  of  capital,  we  note  that  if  the  demand  faced  by  a 
monopolistic  competitive  firm  is  isoelastic  and  its  production  function  is  Cobb-Douglas,  then  07,.  ~  a/.. . 


15 


reasons  for  this  is  the  combination  of  a  low  A.,  a  high  \iA  (mostly  due  to  depreciation),  and  a  large  rj/  (relative 
to  aA). 

Table  7:  SLOW  CONVERGENCE:  INVESTMENT 
Average  A. 


Number  of 

agents  (N) 

15 

100 

1,000 

10,000 

OO 

1.060 

0.950 

0.665 

Number 

of 

50 

1.004 

0.790 

0.400 

Time 

Periods 

200 

0.985 

0.723 

0.305 

CO 

OO 

0.979 

0.696 

0.274 

0.150 

Reported:  average  of  OLS  estimates  of  X,  obtained  via  simulations.  Number 
of  simulations  chosen  to  ensure  that  numbers  reported  have  a  standard  deviation 
less  than  0.002.  Simulation  parameters:  X:  0.15  (annual  data,  from  Caballero  et 
al,  1995),  nA  =  0.12,  aA  =  0.056,  07  =  0.50. 

We  have  assumed  throughout  that  Ay*  is  i.i.d.  Aside  from  making  the  results  cleaner,  it  should  be 
apparent  from  the  time-to-build  extension  in  Section  2  that  adding  further  serial  correlation  does  not  change 
the  essence  of  our  results.  In  such  a  case,  the  cross  correlations  between  contiguous  adjustments  are  no 
longer  zero,  but  the  bias  we  have  described  remains.  In  any  event,  for  each  of  the  applications  in  this 
subsection,  there  is  evidence  that  the  i.i.d.  assumption  is  not  farfetched  (see,  e.g.,  Caballero  et  al  [1995, 
1997],  Bils  and  Klenow  [2002]). 

4     Fragile  Solutions:  Biased  Regressions  and  ARMA  Correction 

Can  we  fix  the  problem  while  remaining  within  the  class  of  linear  time-series  models?  In  this  section,  we 
show  that  in  principle  this  is  possible,  but  in  practice  it  is  unlikely  (especially  for  small  N). 

4.1     Biased  Regressions 

So  far  we  have  assumed  that  the  speed  of  adjustment  is  estimated  using  only  information  on  the  economic 
series  of  interest,  y.  Yet  often  the  econometrician  can  resort  to  a  proxy  for  the  target  y*.  Instead  of  (2),  the 
estimating  equation  is: 

Ay,  -  ( 1  -  X) Ay,- 1  +  AAy,*  +  e, ,  (25) 

with  some  proxy  available  for  the  regressor  Ay*. 
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Equation  (25)  hints  at  a  procedure  for  solving  the  problem.  Since  the  regressors  are  orthogonal,  A 
in  principle  can  be  estimated  directly  from  the  parameter  estimate  associated  with  Ay*,  while  dropping 
the  constraint  that  the  sum  of  the  coefficients  on  the  right  hand  side  add  up  to  one.  Of  course,  if  the 
econometrician  does  impose  the  latter  constraint,  then  the  estimate  of  A.  will  be  some  weighted  average  of 
an  unbiased  and  a  biased  coefficient,  and  hence  will  be  biased  as  well.  We  summarize  these  results  in  the 
following  proposition. 

Proposition.4  (Bias  with  Regressors) 

With  the  same  notation  and  assumptions  as  in  Proposition  3,  consider  the  following  equation: 

Ay?  =  fc0Ayf_,  +&iAy,*+e„  (26) 

where  Ay*  denotes  the  average  shock  in  period  t,  £Ay*;/7V.    Then,  if  (26)  is  estimated  via  OLS,  and  K 
defined  in  (19), 


(i)  without  any  restrictions  on  bo  and  h\: 


(ii)  imposing  bo  =  1  —b].u 


plinij^^b]  =  A  + 
In  particular,  for  N  =  1  and  /1.4  =  0: 


plimT_tJ0     =     -^-(1-A),  (27) 

1  +  K 

plimT_J>x     =     A,  (28) 

X(l-X) 


2{K  +  X)' 


i       1      !~X 
plimT^„b\  =  A  H — . 


Proof  See  Corollary  Bl  in  the  Appendix. 


Of  course,  in  practice  the  "solution"  above  is  not  very  useful.  First,  the  econometrician  seldom  observes 
Ay*  exactly,  and  (at  least)  the  scaling  parameters  need  to  be  estimated.  In  this  situation,  the  coefficient  es- 
timate on  the  contemporaneous  proxy  for  Ay*  is  no  longer  useful  for  estimating  A,  and  the  latter  must  be 
estimated  from  the  serial  correlation  of  the  regression,  bringing  back  the  bias.  Second,  when  the  econometri- 
cian does  observe  Ay*,  the  adding  up  constraint  typically  is  linked  to  homogeneity  and  long-run  conditions 


,4The  expression  that  follows  is  a  weighted  average  of  the  unbiased  estimalor  X  and  the  biased  estimator  in  the  regression  without 
Av*  as  a  regressor  (Proposition  3).  The  weight  on  the  biased  estimator  is  XK/2(K  +  X),  which  corresponds  to  the  harmonic  mean 
of  X  and  K. 
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that  a  researcher  often  will  be  reluctant  to  drop  (see  below). 

Fast  Micro  -  Slow  Macro?:  A  Price- Wage  Equation  Application 

In  an  intriguing  article,  Blanchard  (1987)  reached  the  conclusion  that  the  speed  of  adjustment  of  prices 
to  cost  changes  is  much  faster  at  the  disaggregate  than  the  aggregate  level.  More  specifically,  he  found  that 
prices  adjust  faster  to  wages  (and  input  prices)  at  the  two-digit  level  than  at  the  aggregate  level.  His  study 
considered  seven  manufacturing  sectors  and  estimated  equations  analogous  to  (26),  with  sectoral  prices  in 
the  role  of  y,  and  both  sector-specific  wages  and  input  prices  as  regressors  (the  y*).  The  classic  homogeneity 
condition  in  this  case,  which  was  imposed  in  Blanchard's  study,  is  equivalent  in  our  setting  to  bo  +  b\  —  1 . 

Blanchard's  preferred  explanation  for  his  finding  was  based  on  the  slow  transmission  of  price  changes 
through  the  input-output  chain.  This  is  an  appealing  interpretation  and  likely  to  explain  some  of  the  dif- 
ference in  speed  of  adjustment  at  different  levels  of  aggregation.  However,  one  wonders  how  much  of  the 
finding  can  be  explained  by  biases  like  those  described  in  this  paper.  We  do  not  attempt  a  formal  decompo- 
sition but  simply  highlight  the  potential  size  of  the  bias  in  price-wage  equations  for  realistic  parameters. 

Matching  Blanchard's  framework  to  our  setup,  we  know  that  his  estimated  sectoral  A  is  approximately 
0.18  while  at  the  aggregate  level  it  is  0.135.15 

Table  8:  Biased  Speed  of  Adjustment:  Price-Wage  Equations 

Average  A 

Number  of  firms  (A7) 
100  500  1.000  5.000         10,000  oo 


250  0.416       0.235         0.194  0.148  0.142        0.135 
No.  of  Time  Periods,  T: 
oo 0.405       0.239         0.194          0.148           0.142        0.135 

Reported:  For  T  =  250,  average  estimate  of  X  obtained  via  simulations.  Number  of  simulations  chosen  to  ensure  that 
estimates  reported  have  a  standard  deviation  less  than  0.004.  For  T  =  °°:  calculated  from  (28).  Simulation  parameters: 
X:  0.135  and  T  =  250  (from  Blanchard,  1987,  monthly  data),  nA  =  0.003,  aA  =  0.0054  and  07  =  0.048  as  in  Table  6. 


Table  8  reports  the  bias  obtained  when  estimating  the  adjustment  speed  from  sectoral  price-wage  equa- 
tions. It  assumes  that  the  true  speed  of  adjustment,  A,  is  0.135,  and  considers  various  values  for  the  number 
of  firms  in  the  sector.  The  table  shows  that  for  reasonable  values  of  N  there  is  a  significant  upward  bias  in 
the  estimated  value  of  A,  certainly  enough  to  include  Blanchard's  estimates.16 


,5We  obtained  these  estimates  by  matching  the  cumulative  impulse  responses  reported  in  the  first  two  columns  of  Table  8  in 
Blanchard  (1987)  for  5,  6  and  7  lags.  For  the  sectoral  speeds  we  obtain,  respectively,  0.167,  0.182  and  0.189,  while  for  the 
aggregate  speed  we  obtain  0.137.  0.121  and  0.146.  The  numbers  in  the  main  text  are  the  average  X's  obtained  this  way. 

16The  estimated  speed  of  adjustment  is  close  to  0.18  for  N  =  1,000. 
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4.2     ARMA  Corrections 

Let  us  go  back  to  the  case  of  unobserved  Ay*.  Can  we  fix  the  bias  while  remaining  within  the  class  of  linear 
ARMA  models?  In  the  first  part  of  this  subsection,  we  show  that  this  is  indeed  possible.  Essentially,  the 
correction  amounts  to  adding  a  nuisance  MA  term  that  "absorbs"  the  bias. 

However,  the  second  part  of  this  subsection  warns  that  this  nuisance  parameter  needs  to  be  ignored  when 
estimating  the  speed  of  adjustment.  This  is  not  encouraging,  because  in  practice  the  researcher  is  unlikely 
to  know  when  he  should  or  should  not  drop  some  of  the  MA  terms  before  simulating  (or  drawing  inferences 
from)  the  estimated  dynamic  model. 

On  the  constructive  side,  nonetheless,  we  show  that  when  N  is  sufficiently  large,  even  if  we  do  not  ignore 
the  nuisance  MA  parameter,  we  obtain  better  — although  still  biased —  estimates  of  the  speed  of  adjustment 
than  with  the  simple  partial  adjustment  model. 

4.2.1     Nuisance  Parameters  and  Bias  Correction 

Let  us  start  with  the  positive  result. 

Proposition  5  (Bias  Correction)  Let  the  Technical  Assumptions  (see  page  10)  hold}1  Then  Ayf  follows 
an  ARMA(l.l)  process  with  autoregressive  parameter  equal  to  1  —  X.  Thus,  adding  an  MA(1 )  term  to  the 
standard  partial  adjustment  equation  (2): 

Ayf  =  (l-X)Ayf/_1+v,-ey,_lj  (29) 

and  denoting  by  XN  any  consistent  estimator  of  one  minus  the  AR-coefficient  in  the  equation  above,  we  have 
that: 

plimT -00/v    =  X. 

The  moving  average  coefficient,  9.  is  a  "nuisance"  parameter  that  depends  on  N  (it  converges  to  zero  as  N 
tends  to  infinity),  fa.  Oa  and  °"/-  VM?  have  that: 

Q=~{L-VL2-4)>0, 

with 

2  +  X(2-X)(K-\) 


L 


\-X 
and  K  defined  in  (19). 

Proof  See  Theorem  Bl  in  the  Appendix.     I 

17Strictly  speaking,  to  avoid  the  case  where  the  AR  and  MA  coefficients  coincide,  we  need  to  rule  out  the  knife-edge  case 
N  -  1  =  (2  -  \)n^  /Xo^  .  In  particular,  when  ma  =  0  this  amounts  to  assuming  N  >  1 . 
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The  proposition  shows  that  adding  an  MA(1)  term  to  the  standard  partial  adjustment  equation  eliminates 
the  bias.  This  rather  surprising  result  is  valid  for  any  level  of  aggregation.  However,  in  practice  this  cor- 
rection is  not  robust  for  small  N,  as  the  MA  and  AR  coefficients  are  very  similar  in  this  case  (coincidental 
reduction).18  Also,  as  with  all  ARMA  estimation  procedures,  the  time  series  needs  to  be  sufficiently  long 
(typically  T  >  100)  to  avoid  a  significant  small  sample  bias. 

Table  9:  ADJUSTMENT  SPEED  X:  WITH  AND  WITHOUT  MA  CORRECTION 


Number  of  agents 

(N) 

100 

1,000 

10,000 

AR(1): 

0.18 

0.88 

1.40 

Employment 

ARMA(1,1): 

0.65 

1.29 

1.47 

ARMA(1,1), 

ignoring  MA: 

1.50 

1.50 

1.50 

AR(1): 

0.11 

0.88 

2.72 

Prices 

ARMA(1,1): 

1.27 

2.83 

3.43 

ARMA(1,1), 

ignoring  MA: 

3.55 

3.55 

3.55 

AR(1): 

0.02 

0.44 

2.65 

Investment 

ARMA(1,1): 

0.77 

3.92 

5.29 

ARMA(1,1), 

ignoring  MA: 

5.67 

5.67 

5.67 

Reported:  theoretical  value  of  T,  ignoring  small  sample  bias  (7"  =  °°).  "AR(1)"  and 
"ARMA(1,1)"  refer  to  values  of  x  obtained  from  AR(1)  and  ARMA(1,1)  representations. 
"ARMA(l.l)  ignoring  MA"  refers  to  estimate  obtained  using  ARMA(l.l)  representation,  but 
ignoring  the  MA  term.  The  results  in  Propositions  3  and  5  were  used  to  calculate  the  expres- 
sions for  x.  Parameter  values  are  those  reported  in  Tables  4,  5  and  6. 


Next  we  illustrate  the  extent  to  which  our  ARMA  correction  estimates  the  correct  response  time  in  the 
applications  to  employment,  prices,  and  investment  considered  in  Section  3.  We  begin  by  noting  that  the 
expected  response  time  inferred  without  dropping  the  MA  term  is:19 

1-X 9_       1-X_ 

Tma~~~T^e<-X~~x' 

where  0  >  0  was  defined  in  Proposition  5,  x  denotes  the  correct  expected  response  and  xma  the  expected 
response  that  is  inferred  from  a  non-parsimonious  ARMA  process  (it  could  be  an  MA(«),  an  AR(°°)  or,  in 
our  particular  case,  an  ARMA(  1,1)). 

The  third  row  in  each  of  the  applications  in  Table  9  illustrates  the  main  result.  The  estimate  of  x,  when 
the  nuisance  term  is  used  in  estimation  but  dropped  for  x-calculations,  is  unbiased  in  all  the  cases,  regardless 
of  the  value  of  N  (note  that  in  order  to  isolate  the  biases  that  concern  us  we  have  assumed  T  —  °°). 


18For  example,  if  jiA  =  0,  we  have  that  the  AR  and  MA  term  are  identical  for  N  =  I . 
19See  Proposition  Al  in  the  Appendix  for  a  derivation. 
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The  first  and  second  rows  (in  each  application)  show  the  biased  estimates.  The  former  repeats  our  basic 
result  while  the  latter  illustrates  the  problems  generated  by  not  dropping  the  nuisance  MA  term  (Xma).  The 
bias  from  not  dropping  the  MA  term  is  smaller  than  that  from  inferring  x  from  the  first  order  autocorrela- 
tion,20 yet  it  remains  significant  even  at  fairly  high  levels  of  aggregation  (e.g.,  N  =  1,000). 

5     Conclusion 

The  practice  of  approximating  dynamic  models  with  linear  ones  is  widespread  and  useful.  However,  it  can 
lead  to  significant  overestimates  of  the  speed  of  adjustment  of  sluggish  variables.  The  problem  is  most  severe 
when  dealing  with  data  at  low  levels  of  aggregation  or  single-policy  variables.  For  once,  macroeconomic 
data  seem  to  be  better  than  microeconomic  data. 

Yet  this  paper  also  shows  that  the  disappearance  of  the  bias  with  aggregation  can  be  extremely  slow. 
For  example,  in  the  case  of  investment,  the  bias  remains  above  80  percent  even  after  aggregating  across  all 
continuous  establishments  in  the  LRD  (N  =  10.000). 

While  the  researcher  may  think  that  at  the  aggregate  level  it  does  not  matter  much  which  microeconomic 
adjustment-cost  model  generates  the  data,  it  does  matter  greatly  for  (linear)  estimation  of  the  speed  of 
adjustment. 

What  happened  to  Wold's  representation,  according  to  which  any  stationary,  purely  non-deterministic, 
process  admits  an  (eventually  infinite)  MA  representation?  Why,  as  illustrated  by  the  analysis  at  the  end 
of  Section  4,  do  we  obtain  an  upward  biased  speed  of  adjustment  when  using  this  representation  for  the 
stochastic  process  at  hand?  The  problem  is  that  Wold's  representation  expresses  the  variable  of  interest  as 
a  distributed  lag  (and  therefore  linear  function)  of  innovations  that  are  the  one-step-ahead  linear  forecast 
errors.  When  the  relation  between  the  macroeconomic  variable  of  interest  and  shocks  is  non-linear,  as  is  the 
case  when  adjustment  is  lumpy.  Wold's  representation  misidentifies  the  underlying  shock,  leading  to  biased 
estimates  of  the  speed  of  adjustment. 

Put  somewhat  differently,  when  adjustment  is  lumpy,  Wold's  representation  identifies  the  correct  ex- 
pected response  time  to  the  wrong  shock.  Also,  and  for  the  same  reason,  the  impulse  response  more  gener- 
ally will  be  biased.  So  will  many  of  the  dynamic  systems  estimated  in  VAR  style  models,  and  the  structural 
tests  that  derive  from  such  systems.  We  are  currently  working  on  these  issues. 


-°This  can  be  proved  formally  based  on  the  expressions  derived  in  Theorem  BI  and  Proposition  Al  in  the  appendix. 
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d&y,+k 
[    de, 

lk>okh 

and         x  =  — -= 

i.       /md 

V'(l)   __  S*>i*V* 

Appendix 

A     The  Expected  Response  Time  Index:  x 

Lemma  Al  (t  for  an  Infinite  MA)  Consider  a  second  order  stationary  stochastic  process 

Ay,  =  ^Wk^t-k, 

k>0 

with  \\Iq  =  1,  £*>()¥*  <  °°'  ^le  £' '5  uncorrelated,  and  e,  itncorrelated  with  Ay,_i,Ay,_2. ...  Assw/Me  ?/i«r 
^M  —  Yk>o^VkZ   has  all  its  roots  outside  the  unit  disk. 
Define: 

Ik  =  I 

Then: 

Proof  That  Ik  =  ty*  is  trivial.  The  expressions  for  x  then  follow  from  differentiating  *¥(z)  and  evaluating  at 
z=l.     I 

Proposition  Al  (x  for  an  ARMA  Process)  Assume  Ay,  follows  an  ARMA(p.q): 

P  9 

Ay,  -  Yj  fa&yt-k  =  e,  -  X  Qk£,-k- 

k=l  k=\ 

where  $>{z)  =  1  —  Y,k=\  <J*JtS*  a,!^  ®(z)  —  1  ~~  Sf=i  6*z*  'iflve  «//  r/iefr  roorr  outside  the  unit  disk.  The  assump- 
tions regarding  the  E,  's  are  the  same  as  in  Lemma  A 1. 
Define  x  as  in  (30).  Then: 

Proof  Given  the  assumptions  we  have  made  about  the  roots  of  <J>(z)  and  0(z),  we  may  write. 

0(L) 

Ay< =  *(Z)E" 

where  L  denotes  the  lag  operator.  Applying  Lemma  Al  with  0(z)/4>(z)  in  the  role  of  *F(z)  we  then  have: 

'  ©(i)     *(i)  "  1-zjLi**     i-iLi6*' 

Proposition  A2  (x  for  a  Lumpy  Adjustment  Process)  Consider  Ay,  in  f/ie  simple  lumpy  adjustment  model  (8) 
and  x  <te/inerf  in  (/4).  77ien  x  =  (1  -  X)/X.21 


2 'More  generally,  if  the  number  of  periods  between  consecutive  adjustments  are  i.i.d.  with  mean  m,  thent  =  m—  1.  What  follows 
is  the  particular  case  where  interarrival  times  follow  a  Geometric  distribution. 
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Proof  dAyt+k/dAy*  is  equal  to  one  when  the  unit  adjusts  at  time  t  +k,  not  having  adjusted  between  times  t 
and  t  +  k—1,  and  is  equal  to  zero  otherwise.  Thus: 


/*  =  E, 


dAyl+k 
dAy; 


Prfe+*  =  1  ,  £,+*_,  =  ^+,_2  =  ...  =  &=  0}  =  X(l  -  X)k. 


(31) 


The  expression  for  x  now  follows  easily. 


Proposition  A3  (x  for  a  Process  With  Time-to-build  and  Lumpy  Adjustments)  Consider  the  process  Ayt 
with  both  gradual  and  lumpy  adjustments: 


ith 


K  K 

Ay,  =  Yj  tyk&yt-k  +  (1  -  X  <WAy,, 

A=l  k=l 


ll-l 

k=0 


(32) 


(33) 


where  Ay*  is  i.i.d.  with  zero  mean  and  variance  O2. 
Define  x  by: 


Then: 


Zjt>o^E, 

SAy; 

Si>oE 

dby,+k 
3Ay,- 

Proof  Note  that: 


E, 


9Ay, 


+k 


[  dAy; 


3Ayr+*  9  Ay, 


+j 


y  E 

;e0    '  [d&yt+j   dAyf 


k 

SB, 

7=0 


Gt_j 


Ay, 


+J 


Ay,* 


2,Gk-jHJt  (34) 

;=0 


where,  from  Proposition  Al  and  (31)  we  have  that  the  Gk  are  such  that  G(z)  =  £*>o<j*z*  =  l/Q>(z),  and 
Hk  =  X{\  -  X)k.  Define  H(z)  =  E*>ofl*Z*  and  I{z)  =  G{z)H(z).  Noting  that  the  coefficient  of  z*  in  the 
infinite  series  I(z)  is  equal  to  7*  in  (34),  we  have: 

r(i)     g'(i)    //'(i)      sf=1^      i-A 

""    7(1)    ■"    G(l)+//(1)    ■■   l-EJ^fe  A     ■ 
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B     Bias  Results 

B.l     Results  in  Section  2 

In  this  subsection  we  prove  Proposition  2  and  Corollary  1.  Proposition  1  is  a  particular  case  of  Proposition  3, 
which  is  proved  in  Section  B.2.  The  notation  and  assumptions  are  the  same  as  in  Proposition  A3. 

Proof  of  Proposition  2  and  Corollary  1    The  equation  we  estimate  is: 

Ay,  =  Y,  akAyt-k  +  v,.  (35) 

*=i 

while  the  true  relation  is  that  described  in  (32)  and  (33). 

An  argument  analogous  to  that  given  in  Section  2.2  shows  that  the  second  term  on  the  right  hand  side 
of  (32),  denoted  by  w,  in  what  follows,  is  uncorrected  with  Ay,_£,  k  >  1.  It  follows  that  estimating  (35)  is 
equivalent  to  estimating  (32)  with  error  term 

*»  =  =  (i-Efo&XAtf-*. 

and  therefore: 

f   4>;.     ifk=l,2,-,K, 

plimr_„<5it  =   < 

[   0      ifk  =  K+l. 

The  expression  for  pUm^^^t  now  follows  from  Proposition  A3.     | 

B.2     Results  in  Sections  3  and  4 

In  this  section  we  prove  Propositions  3,  4,  and  5.  The  notation  and  assumptions  are  those  in  Proposition  3. 
The  proof  proceeds  via  a  series  of  lemmas.  Propositions  3  and  5  are  proved  in  Theorem  Bl.  while  Proposi- 
tion 4  is  proved  in  Corollary  Bl. 

Lemma  Bl  Assume  X]  and  Xj  are  i.i.d.  geometric  random  variables  with  parameter  X,  so  that  Pr{X  =  k}  = 
X(l  -X)k-\k=  1,2,3,...  Then: 

m)  =  I 

Vkrpf,]     =     ^. 

In  particular,  the  I, - ,  's  (defined  in  the  main  text)  are  all  geometric  random  variables  with  parameter  X. 
Furthermore,  /,.,  and  lj  s  are  independent  ifi  ^  j. 

Next  define,  for  any  integer  s  larger  or  equal  than  zero: 


Ml=(°.fy 

[  min(Xi  —s 


ifXx  <  s, 
-s,X2)     ifXx>s. 


25 


Then 

EIM-('-«S 


Proof  The  expressions  for  E[X;]  and  Var[X,]  are  well  known.  The  properties  of  the  /,/s  are  also  trivial. 
To  derive  the  expression  for  E[Ms],  denote  by  F(k)  and  f(k)  the  cumulative  distribution  and  probability 
functions  common  to  X\  and  X2-  Then: 

Pr{M,  =  k}  Pr{X,  -  s  =  k,X2  >  k  + 1}  +  Pt{X2  =  k,Xt  -5  >  k+  1}  +  Pr{Xi  -s  =  Jfe,X2  =  it} 

=    /(J  +  /:)[l-F(A)]+/(*)[l-F(*  +  5)]+/W/(fc  +  5), 

and,  since  1  —  F(fc)  =  (1  —A)*,  with  some  algebra  we  obtain: 

Pr{M,  =  A-}=A(2-A)(l-X)s+2i-2. 

Using  this  expression  to  calculate  E[MS]  via  X;>i  &Pr{Mj  =  k}  completes  the  proof.     | 

Lemma  B2  For  any  strictly  positive  integer  s, 

Cov(&yiihAyu+s)  =  -(l-X)snl 
Proof  We  have: 

Cov(Ay,+,;Ay,)     =    E[Ay,+sAyt]  -  n\ 

=     Yj  XEiA>''+^A>v|/<+5  =  q,l,=k£t+s  =  l£t  =  l}T?r{lt+s  =  q,l,=k&+s=  1,^  =  1}  -  & 

ft=lg=l 

where  in  the  second  (and  only  non-trivial)  step  we  add  up  over  a  partition  of  the  set  of  outcomes  where 
Ay,+sAy,  +  0. 

The  expression  above,  combined  with: 

(  XA{l-X)k+i-2.     ifq=l,...,s-l, 

Pr{lt+s  =  q,h=k&+s  =  l&  =  l}  =  I 

{  \3(I-X)k+1-2,    if q  =  s, 

and  some  patient  algebra  completes  the  proof.     | 

Lemma  B3  For  q^r  and  any  integer  s  larger  or  equal  than  zero  we  have: 

Cov(AyV+J,Avr,)  =  ^L.(l_A)'ai 
Proof  Denote  v,-,  =  Ay*,  =  vf  +v'ir  Then: 


E[Ay9.,+JAyr.,|/,y.,+_v,/r.,]     =     E[^.;+5(    £    vg;/+5_7)^(  J)  vr>r_jt)|^:,+J,Zr,,] 
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iqJ+!-urJ-y 
=    *    1    I  E[4_/_t] 

j=0     i=0 

=    X2lqj+slrJn2A  +  X2Ms(lqj+s,lrJ)crA. 

Where  the  first  identity  follows  from  the  definition  of  the  Ay,/s,  the  second  from  conditioning  on  the  four 
possible  combinations  of  values  of  ^i(+5  and  t,rJ,  and  Ms(lql+S,lrl)  denotes  the  random  variable  analogous 
to  Ms  in  Lemma  Bl,  based  on  the  i.i.d.  geometric  random  variables  lqit+s  and  lrl.  The  remainder  of  the  proof 
is  based  on  the  expressions  in  Lemma  Bl  and  straightforward  algebra.     | 

Lemma  B4  We  have: 

Var(Ay,,,)  =  -i-^ \rA  +  <rA  +  a2. 

Proof  The  proof  is  based  on  calculating  both  terms  on  the  right  hand  side  of  the  well  known  identity: 

Var(Ay,,,)  =  Var^Ay,,,!/,,])  +E/„(Var(A.yM|/,,)).  (36) 

Since  [Ay,,,  |/,-.,]  has  mean  IjjUa  and  variance  /,,,c>2,  we  have: 

E[Ay2],|J,v]=E[^,(  X  v,,,_,)2HA(/,,(c^  +  c2)  +  /,,M.l 

7=0 

A  similar  calculation  shows  that: 

E  [Ay,-.,  |  /,-,,]  =\lunA. 

Hence: 

Var(Ay,,  |/„ )  =  X/,- , (oj  +  a2 )  +  X(  1  -  \%& 

and  taking  expectation  with  respect  to  /,-.,  (and  using  the  expressions  in  Lemma  Bl)  leads  to: 


E,u Var(Ay„|/„)  =  O2  +  O2  +  (1      X)^'  %\ 


2 


An  analogous  (and  considerably  simpler)  calculation  shows  that: 

Var,„(E[Ay,,|/,-,])  =  (l-A)/4  (38) 

The  proof  concludes  by  substituting  (37)  and  (38)  in  (36).     I 


Lemma  B5  For  N  >  1 . 


Var(A}A)  =  l|[i  +  _A_(yV_1)]a2+G2  +  ^l_A)/i2 


Proof  The  case  A'  =  1  corresponds  to  Lemma  B4.  For  N  >  2  we  have: 


Var(Ayf)  -1  £  £  Cov(Ay,„A>V,) 
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=     ^  {/VVar(Ayu)  +N(N  -  l)Cov(Ay,,„  Ay2,,)}  . 

The  first  identity  follows  from  the  bilinearity  of  the  covariance  operator,  the  second  from  the  fact  that 
Var(Ay,i/)  does  not  depend  on  i  and  Cov(Ay,ir,  Ay,-^),  i  /  j,  does  not  depend  on  i  or  j. 

The  remainder  of  the  proof  follows  from  using  the  expressions  derived  in  Lemmas  B3  and  B4.     | 

Lemma  B6  Recall  that  in  the  main  text  we  defined  Ay*  =  'Z'iL]  Ay*, /TV.  We  then  have: 

Var(Ay,*)     =     0*  +  ^, 

Cov(Ayf,Ay;)     =     M°A  +  f]- 
Proof  The  proof  of  the  first  identity  is  trivial.  To  derive  the  second  expression  we  first  note  that: 

'u-l 

Cov^A^,)     =    E&(  Y  Av*(_,)Ay},]  -  fi 

k=0 

X  Efc(  X  Ay*,_,)Ay},  |  /,,,  =  k^,,  =  1]A2(1  -  X)k^  -  & 

k>\  k=0 

*?  S  ti°A  +  5,-jo?  +  fi)  +  (k-l  )&]  ( 1  -  X)k~ '  -  A 

k>\ 

=    A(a^  +  8,.Jo/2), 
with  5y  =  1  if  i  =  j  and  zero  otherwise.  The  expression  for  Cov(A_vf  ,  Ay*)  now  follows  easily.     | 

Theorem  Bl  Ay?  follows  an  ARMA(  1,1 )  process:22 

Ay,  -  if  Ay, _]  =  e,  -6e,_i, 
where  £,  denotes  the  innovation  process  and 

1 


_  2  +  \{2-X){K-\) 
l-X 

Also,  as  N  tends  to  infinity  9  converges  to  zero,  so  that  the  process  for  Ayy  approaches  an  AR(1)  (and  we 
recover  Rotemberg's  (1987)  result). 
We  also  have: 


TV 


{^l^-Dol-^ii-Xr.         5=1,2,. 


22  We  have  that  0  =  9,  so  that  the  process  reduces  to  white  noise,  if  and  only  if  N  =  1  4-  2-t-^  ^4- . 
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p?    =    --— (i-a.)*;       5=1,2,3,... 

where  G^  anrf  p^  denote  the  s-th  order  autocovariance  and  autocorrelation  coefficients  of  Ayy ,  respec- 
tively. 23 

Proof  We  have: 

N      N 

Cov(Ay,+5,Ay,)     =     £  £  Cov(Ay,,,+i,Ay;V) 
'"=1  j=1 

A'Var(Ay,,)  +N{N  -  l)Cov(Ayi,,+5,Ay2,,). 

Where  the  justification  for  both  identities  is  the  same  as  in  the  proof  of  Lemma  B5.  The  expression  for 
crj'  now  follows  from  Lemmas  B3  and  B4.  The  expression  for  the  autocorrelations  follow  trivially  using 
Lemma  B5  and  the  formula  for  the  autocovariances. 

The  expressions  we  derived  for  the  autocovariance  function  of  Ay,  and  Theorem  1  in  Engel  (1984)  imply 
that  Ay,  follows  an  ARM A(  1,1)  process,  with  autoregressive  coefficient  equal  to  1  —  X.  The  expression 
for  8  follows  from  standard  method  of  moments  calculations  (see,  for  example,  equation  (3.4.8)  in  Box 
and  Jenkins  (1976))  and  some  patient  algebra.24  Finally,  some  straightforward  calculations  prove  that  9 
converses  to  zero  as  n  tends  to  infinity.     | 


Corollary  Bl   Proposition  4  is  a  direct  consequence  oj  the  preceding  theorem. 

Proof  Part  (i)  follows  trivially  from  Proposition  3  and  the  fact  that  both  regressors  are  uncorrelated.  To 
prove  (ii)  we  first  note  that: 

h    =Cov(Ayf,-Ay{v_,1Ay;-A^_1) 

1  Var(Ay;-Ayf_,) 

From  Lemma  B6  and  the  fact  that  Ay*  and  Ay,_i  are  uncorrelated  it  follows  that: 

Cov^-Ay^.Ay'-Ay^)  X  (<%  +  #\  +  (1  -p?)Var(Ayf ), 

Var(Ay;  -  Ayfl, )  oj  +  ^  +  Var(Ay? ). 

The  expressions  derived  earlier  in  this  appendix  and  some  patient  algebra  complete  the  proof.     | 


2,The  expressions  for  plim7-_J„XA'  in  Proposition  3  follow  from  noting  thai  XN  =  1  -  p^. 
24  A  straightforward  calculation  shows  that  L  >  2,  so  that  we  do  have  |0|  <  1 . 
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